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Analytical Design of Multispectral Sensors 


DANIEL J WIERSMA, member, ieee, anu DAVID A LANDGREBE, fellow, iele 


Abstract -An analytical f .occdure for the design of the spectral chan- 
nels for multispectral remo(e sensor systems is defined. An optimal 
design based on the criterion of minimum mean-square representation 
enor using the Karhunen-Locvc expansion was developed to represent 
the spectral response functions from a stratum based upon a stochastic 
process scene model. From the overall pattern recognition svstem 
perspective the effect of the representation accuracy on a typical per- 
formance criterion, the probability of correct classification, is investi- 
gated. The optimum sensor design provides a standard against which 
practical (suboptimum) operational sensors can be compared. An ex- 
ample design is provided and its performance is illustrated. 

Although tht analytical technique was developed primarily for the 
purpose of sensor design it was found that the procedure has potential 
for making important contributions to scene understanding It was con- 
cluded that spectral channels which have narrow bandwidths relative to 
current sensor systems may be necessary to provide adequate spectral 
representation and improved classification performance. 

I Introduction 

A PATTERN recognition system as used in a remote 
sensing system for earth resources consists of three 
fundamental components— the scene, the sensor, and the pro- 
cessor (Fig 1) The scene is that portion of the earth’s surface 
observed by the sensor The desired information is contained 
in the spcctial, spatial, and temporal variations of the electro- 
magnetic energy emanating from the scene The sensor col- 
lects the energy and measures its features The processor is 
typically a digitally implemented classification algorithm 
which makes an appropriate decision based on the feature 
measurements provided by the output from the sensor Vari- 
ous types of ancillary data are also now typically used in the 
decision-making process 

At present the design of the processor algorithms is quite 
advanced and provides variety and flexibility for optimal per- 
formance given a feature set [4] , [5) However, the design of 
the best set of features is a complex matter which is not well 
understood There are many sensor parameters involved [11, 
ch 7] In the current work we limit considerations to the de- 
sign of the spectral aspects of features to make the problem 
more tractable, leaving other aspects to later occasions 
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Fig 1 ^ttern recognition system 


II Spictral Representation and Optimum 
Sensor Design 

In order to achieve an optimal design of a set of spectral fea- 
tures one must have suitao’e analytical representations for 

1) the spectral response of the scene, 2) the sensor system, 
3) the processor system, and 4) one must have a suitable, 
analytically expressible optimality criterion Further, we note 
the following factors which influence the creation of a spectral 
feature design procedure 

1) The scene is very complex in the fashion in which it re- 
flects and emits optical radiation Mathematical models which 
predict the scene radiant exitancc at least to the level of ac- 
curacy and precision needed for our problem, do not yet exist 
As a result an empirical scene model must be used 

2) Because satellite-borne sensor systems are very expensive 
they cannot usually be designed specifically for a ceitain use 
or user Rather they must be optimized with regard to a large 
number of scenes and uses (Fig 2) The feature space which 
the sensor defines must be adequately detailed, for example, 
such that in caily season when agricultut ll crop canopies have 
achieved only 10-1 5-percent cover, both the crop species 
mapping user and the soils mapper can be served This fact is 
important in the choice of optimality criterion, as will be seen 
shortly 

3) It is highly desirable that the spectral features be designed 
in such a way that they are maximally efficient in the sense 
that a feature set of any given size contain the maximum 
amount of useful information possible so that any given 
analysis can proceed with the smallest number of features 
possible There arc at least three reasons for this feature effi- 
ciency in lhis sense tends to decrease the amount of piocessor 
compulation icquircd, it tends to decrease the processor com- 
plexity required, and it tends to reduce the amount of training 
sample daia needed 

4) There are a number of constraints on the design of a 
sensor, generally of a practical character, which cannot rea- 
sonably be expressed analytically Examples arc those result- 
ing from optical design considerations, sensor material sensi- 
tivity curves, cost factors, spacecraft size and weight 
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Fig 2 A sensor system must be designed to perforin satisfactorily for 
many reasons scenes at various times of the season, i e , defining fea- 
tures spaces detailed enough to permit successful partitions for vary- 
ing applications 



Fig 4 Spectral response function for Mature Wheat collected on 
August 4, 1977, over Williams County, ND 



Fig 3 Sensor system design procedure 


considerations, etc , and especially from the interrelationships 
of these types of factors To mitigate this circumstance we will 
use the scheme depicted in Fig 3. That is, we will determine 
optimal sensor characteristics using entirely analytical means 
without regard for their physical realizability These charac- 
teristics will then serve as a guide by which to determine 
nearly equivalent but physically realizable characteristics which 
perform nearly as well 

In order to achieve the desired spectral feature design scheme 
we must deal specifically with each of these lettered and num- 
bered items above 

A The Scene Model 

Let us begin by considering the information bearing aspects 
of the spectral response function .v(X) (8] Tins response 
function (e g , from a single pixel) is proportional to the elec- 
tromagnetic energy received by the sensor as a function of 
wavelength X (Fig 4) Many factors determine the spectral re- 
sponse function for a given observation The irradiance of the 
sun, the conditions of the atmosphere, and the reflectance of 
the surface features all have important effects on the response 
Since a deterministic lelationship between the response func- 
tion and the many factors affecting it would be very complex, 
the set of functions which are observed m practice are best 
modeled as a stochastic process 

The ensemble of the stochastic process [13] will be defined 
in terms of the stratification necessary to apply pattern recog- 
nition methods to the earth observational problem A stratum 
S is defined as the largest contiguous area wluch can be classi- 
fied to an acceptable level of performance with a single tram- 


Slistum Tr* Ensembt* 



Fig 5 Realization of a stratum as the ensemble of spectral sample 
functions 

mg of the classifier It is noted that the sensor must be de- 
signed to operate satisfactorily over a large number of such 
strata, which vary greatly with time, location and application 
The collection of all possible strata which a sensor may ob- 
serve is denoted by S 0 Since the set S 0 is quite large, it is 
necessary to select a smaller subset which is representative in a 
statistical sense in order to perform the design 
The random experiment for the stochastic process consists 
of the observation of a point in a stratum S Each point in the 
stratum is mapped into a spectral response function (Fig 5) 
The collection of all response functions from a stratum defines 
an ensemble The ensemble plus the corresponding piobabihty 
measure defines the stochastic process [13] It is appropriate 
to assume a Gaussian probability measure for tlus process [3] . 

B The Scrsor System Model 

Next we choose a mathematical model for the sensor to 
represent the spectral response function for each observation 
Let the sensor be represented by a set of A ; filter functions or 
basis functions (0,(X)} such that the output of each filter is 
given by (Fig. 6) 

*,= I A<X)«,(X)rfX 0) 

J s\ 

The output of the senscr model is a sequence, 

x N } = X, which represents the spectral response by the 
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Fig 6 Sensor system model 



fidelity with which the output of the sensor represents the 
input We will choose the set {<£,(A)} such that for a given 
x(A) the approximation x(A) is as close as possible to the true 
spectral response function One may think of this approach as 
one intended to minimize the information loss through the 
sensor even though it cannot be known to the sensor designer 
what the information is Ir passing from x(A) to {*,} there is 
no information loss ifx(A) is recoverable from {*,} 

A common criterion for representation accuracy is the ex- 
pected mean-square representation error given by 


E U,} = 




[x(A) - x(\)] : 


However, it is desirable at this point to generalize this criterion 
by introducing a weight function w(A) on the spectral interval 
As will be seen, the weight associated with each X can be used 
to introduce into the analysis a pnon knowledge concerning 
the spectrum Thus (1) and (3) become [16] , 


I*i g 7 Approximation of the spectra! response function by a set of 
four basic functions 


approximation 

A:(X)=“A:i0i(X) + .’f 2 0j(X)+ ’ + Xjv0;v(X) 

- £ *,*,<X)-$(X) (2) 

1 = 1 

A simple illustration of the concept is given in Fig 7 How- 
ever, by relaxing the usual restrictions on the shape of the 
{ <P n ( A) } » considerable advantage can be obtained There is no 
theoretical or practical reason, for example, for the {$„(A)} to 
be nonoverlappmg What is needed is to determine the ordered 
set of basis functions which are optimal with regard to a mean- 
ingful system performance cuterion 

C The Optimality Criterion Including the Processor Model 
A key consideration is the choice of the optimality criterion 
It is desirable to optimize the sensor design with respect to an 
overall system (including the scene and processor) pci formance 
criterion The probability of correct classification is the cri- 
terion to be used here This choice is made because it is an ob- 
jective indicator of desired performance in a practical sense for 
a large proportion of applications, and it is perhaps the best 
studied and understood in a theoretical sense In selecting this 
performance measure there must also be associated with it a 
processor (classifier) model, in this case we chose the maxi- 
mum likelihood rule, for the same reasons 
However, because of the design factor pointed out in item 
(2) and Fig 2, we find it desirable to define an intermediate 
optimality criterion Because the sensor must function over a 
varied collection of strata using anv of a large collection of 
classifies, a criterion was chosen which is a measure of the 


x(A)0,(A)w(X)c/A 


E {e r } = E I J f*(A) - *(A)j 2 w(X) d Xj (3a) 

We want to choose the set of basis functions {<p, (A)} which 
is optimal with respect to the spectral icpiescnta^on criterion 
of expected mean-square error t {c r } Mere specifically, it is 
desired that the representation be complete in the sense that 
the expected mean-squaic error for any function in the en- 
semble be made arbitrary small simply by including enough 
terms, that convergence of the approximation to the original 
response be rapid in the first few terms, and, without loss of 
generality, we may also ask that the basis functions be 
orthogonal to each other 

A technique for deternrntng the set of optimal basts func- 
tions for an ensemble which satisfies the desired propones is 
based on the weighted Karhunen-Loeve expansion (2), [16], 
[17] The solution to the homogeneous linear integral 
equation is 

7,0,00= f *(X.U 0,0) K£) d\ (4) 

J A 

with the covariance function of the stochastic process, A' (A, f), 
as kernel is a set of eigenfunctions (<£,(A)} with corresponding 
eigenvalues y, If the eigenvalues are arranged in descending 
order, the cormsponding sequence of eigenfunctions can be 
used to form a linear combination of the eigenfunctions which 
converges to the original spectral response function with arbi- 
trarily small expected mean-square error Furthermore, be- 
cause of the ordering of the eigenvalues, convergence in the 
first few terms is very rapid This rap'd convergence allows 
truncation of the series expansion after a finite number of 
terms N with mean-square error minimized over all possible 
choices of N basis functions The mean square error is given by 

£ (er) = E 7, ( 5 ) 

/- l 
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Since ilie unmodified (w(\) = I) Karhuncn-Locve expansion is 
a well-studied technique and satisfies the desired properties for 
finding the basis functions, a sound analytical method is pro- 
vided for determining the optimal set of basis functions 

The optimal sensor design problem may be solved on a digital 
computer using empirical data taken by field measurements 
Some approximations must be made in order to take into con- 
sideration some practical constraints First the response func- 
tions arc not available as continuous functions but are ob- 
tained in the field by sampling the spectrum with an instru- 
ment that uses very narrow spectral window* Secondly, the 
parameters of the process are not known a priori, hence, it is 
necessary to estimate the mean and covariance functions using 
a representative sample from the ensemble Finally, because 
the data will be stored and processed digitally it is necessary to 
quantize the amplitude of the response at cacn of the spectral 
sample points Each of these constraints potentially can con- 
tribute to the representation error It has been shown that 
with reasonable care in selecting a sufficiently high spectral 
sampling rate, a large enough sample from the ensemble, and a 
large number of quantization intervals that the contribution of 
these factors to the representation error is small (16] The 
integral equation (4) becomes the matrix equation 

<8>r=MV<S> (6) 

where <5> is the mainx of eigenvectors, I is the diagonal matrix 
of eigenvalues, K is the covariance matrix, and W is the diago- 
nal matrix of weight coefficients 

111 RFLATIONSHir BhlWFrN TH1 SlM CTRAL 
Ri PR ESI NT ATI ON *N!> SYSIhM PER hOKM ANC t 

The performance of the overall system is ultimately what we 
wish to optimize For this purpose, as previously indicated the 
probability of correct classification P c lias been chosen as the 
performance indicator to be optimized If the vector X is an 
observation from one of M classes C i% t = I, 2, , M with 

a priori probabilities P t , the probability of correct classifica- 
tion, using the maximum likelihood rule is given by 

= f n \ JX {PiP( x \C,))dX (7) 

where p(Y|C,) is the conditional (multivariant) probability 
density function lor class i The integral in (7) is over the 
observation space 

The analytical procedure based on (he weighted Karhunen- 
Loeve expansion has prescribed a sensor design which mini- 
mizes the mean-square representation error One would like 
to know how the ability to represent a process influences the 
classification performance To study this relationship the 
graph of the probability of corcct classification versus ex- 
pected mcan-squav error is introduced (Fig 8) We will 
briefly discuss some of its characteristics 

The addition of terms to the series expansion causes a 
monotonic decrease in the spectral representation error, but 
the effect of the additional terms on the overall system per- 
formance must be determined It can be shown that increasing 
the number ol terms in the representation will never decrease 
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1 ig 8 Probabfriy of a correct classification as a furwuon of exacted 
mean-square representation error 

the performance provided that the stochastic process is com- 
pletely known If after N terms the improvement in per- 
formance is small compared to the reduction in representation 
eiror, then the representation is sufficient This is illustrated 
by case A of Fig 8 m which the Threshold T indicates the 
minimum required P [e r ] However, if the performance is 
showing significant improvement for a small decrease in the 
mean-square error, case B of Fig 8, mo.e terms are necessary 
to complete the representation 

Since the parameters of the stochastic process must be esti- 
mated from \ sample of the ensemble, the effect of the size of 
the sample relative to the dimensionality of the system is im- 
portant Hughes [10] has shown that if the sample size is too 
sit id 11 , the classification performance may actually be degraded 
by adding terms to the expansion Thus it is necessary to 
maintain a large set of sample functions from which to esti- 
mate the statistics 

The choice of information classes also influences the per- 
formance of the pattern recognition system For purposes of 
classifying the data .'to distinct classes it is required that the 
class list have the following properties simultaneously [II] 

1) Each class must be of interest to the user, i e , of in- 
formational value 

2) The classes must be separable in terms of the features 
available 

3) The list must be exhaustive, i e , there must be a class to 
which it is logical to assign each pixel in the scene 

The classes may be arranged in a lucrarchial tree structure in 
which case classes deeper in the tree require different repre- 
sentation accuracies to achieve a given level of classification 
performance 

The area of the ground resolution element is determined by 
such system characteristics as the instantaneous field-o r -vicw 
(IFOV), the altitude of the sensor, the scan rate, and the ve- 
locity of the sensor These are examples of spatial representa- 
tion parameters The size of the objects which can be identi- 
fied and the energy' available are influenced by the choice of 
ground resolution element size. If a typical object which one 
wishes to identify is smaller than the ground resolution ele- 
ment size, then it is very difficult to classify that object 
Mobasscri [12] has investigated the effect of the area of the 
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resolution clement on classification performance Increasing 
the area often improves the signal-to*noise ratio which in turn 
can improve the classification performance 
For a given remote sensing problem the signal is the part of 
the received response which is information bearing, and the 
noise is that part which is nonmformation beating The in' 
fluencc of the s*gnal-to-noisc ratio where the noise is white, 
Gaussian and additive was demonstrated by Ready et al [14] 
Results show that overall classification performance decreased 
with an increase m the noise level A class which was difficult 
to identify under low noise level conditions suffered the most 
degradation when noise was added 

IV Experimental Sys^m 

An experimental software system has been set up to imple- 
ment the analytical procedure that has been developed The 
software system has been implemented on an IBM 370 com- 
puter at the Laboratory for Applications of Remote Sensing 
(LARS) at Purdue University, Lafayette, IN 
A collection of field data consisting of spectral response 
functions on three dates from Williams County. ND, and three 
dates from Finney County, KS, was available from the field 
measurements library at Purdue/LARS More than one thou- 
sand spectra were available from each location and collection 
date The response functions were sampled in wavelength 
using narrow windows of 0 02 pm over the range 0 4 < 
X < 2 4 urn 

The optimal set of basis functions is found numerically by 
estimating the covariance matrix from the sample response 
function Maximum likelihood estimates of the mean and 
covariance matrices are given by 


_ ~ I n . 

X = F iX)*X~-£X l 

n S 1=1 


and 


1 ** 


k *7fI. W,-X){X,-X) t 

N > 7m 


■J(X- X) 


TABLE I 

Spectral Band Locations for Tuo Practical Sensor Designs* 



Siwor 

1 


Seu»ot Huab«r 

2 


Item length 

Uad 

L’*ve tenjtti 

V 


to 

X . 

uk 

k 

l u «• 


1 

3 

to 

6m 

1 

45 to 

52m 

2 

.6 

to 

7l» 

2 

52 to 

60 v« 

3 

7 

to 

6l» 

3 

63 to 

69m 

4 

8 

to 

1 lm 

4 

76 to 

90m 





3 

i 55 to I 

75m 





6 

2 08 to 2 

35 m 


•The band edges of these sensors *ere selected to coincide with the 
nominal bandwidths of the MSS of Landsats 1. 2 . and 3, and those of 
the TM of Landsal D 


characteristic could be simulated, most of the sensors which 
we.e simulated consisted of a small set of rectangular basis 
functions, i e , 


MX)- 


10 . \ lk <\<\ uk 

0 0, otherwise 


( 11 ) 


( 8 ) 


( 9 ) 


where is the number of sample functions available and X, is 
the ah sample vector The covariance is the kernel in the 
linear integral cquition whose solutions are the optimal basis 
functions or eigenvectors A remarkably stable and accurate 
method lor numerically computing the eigenvalues and eigen- 
vectors was published by Grad and Brebner |6| 

The eigenvectors arc used to perform the linear transformation 


( 10 ) 


on the data set The class conditional statistics arc computed 
using the transformed data 

In order to compare the performances of two systems an 
algorPhm which estimates the probability of correct classifica- 
tion for an M class problem, given the class conditional multi- 
variate Gaussian statistics, was used [16| This algorithm, 
which is based on the stratified posterior estimator [15], was 
found to be accurate within one-half of 1 percent 

The experimental system also included an ability to simu- 
late (suboptimal) practical sensors Although nearly any sensor 


where the \ tk and \ uK are the lower and upper limits of the 
spectral bands The spectral bands of two suboptimal sensors 
which were implemented are listed in Table I 

V Results 

It became necessary during this research to create a means 
for incorporating certain types of ancillary information into 
the optimisation process The reason for this can best bv illus- 
trated with an example We first determine the optimal fea- 
tures, {<£*(X)1, for a data set using the uniform weight func- 
tion of Fig 9 Plots of the first four eigenvectors for tins case 
arc shown in Fig 10 Now the shape of an eigenfunction is 
instructive in this application because whenever an eigenfunc- 
tion is large (either positive or negative) a! a given X, it implies 
•that that wavelength is important ui representing the given 
ensemble and, therefore, perhaps also in classifying tt 

It was observed in this example that the eigenvectors are 
dominated at least in the case of k = 1 and 4 by components in 
the bands near 1 4 and 1 9 pm These bands are dominated by 
water absorption m the atmosphere, and it is known a prion 
that very little solar flux reaches the earth’s surface in them 
Thus these regions near I 4 and I 9 pm represent noise rather 
than signal, and under test, the eigenvectors which were sensi- 
tive to these bands contributed very little to the classification 
performance Using a second weight function which assigns a 
very small weight in the water absorption bands (Fig 1 i) the 
influence of ihe spectrum in the water absorption bands is 
significantly reduced, and a marked improvement in classifica- 
tion performance (eg, a classification e r ror rate reduction 
from 19 6 percent to 1 1 8 percent for a six-feature classifica- 
tion) is observed 

It is a characteristic of the Karhuncn-Loeve representation 
that spectral intervals with the greatest ensemble vanation are 
given greatest emphasis Some of this ensemble vanation may 
be expected io be information-beanng (signal) and some not 
(noise). As can be seen by this example, the weighting func- 
tion provides a convenient means to incorporate into the fea- 
ture design pnor knowledge about whether ensemble varia- 
tions in a given range are signal or noise In the remainder of 
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Fig. 9 Weight function number 1 
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Fig 10 First four optimum basis functions using weight function 
number 1 over Williams County, May 8, 1977, data 



Tig 11 Weight function number 2 

the results presented, the weighting function of Fig 1 1 was 
used Two other weighting functions have been tested [16] , 
tiie choice of which is the best weighting function is thought 
to depend on a number of factors not all of which are well 
understood 

Results for the design of the optimal sensor for one of the 
six sets of data are presented in Figs 12-14. 1 The data from 
the stratum was collected over Williams County, ND, on May 
8, 1977 The three classes represented in this d*ta set were 
spring wheat, summer fallow, and pasture Since the data were 
taken relatively eaily in the growing season, the wheat canopy 
was still quite sparse which leads one to expect the wheat class 
to resemble spectrally bare sod or fallow In Fig 12 the ex- 
pected mean square error is plotted as r. function of the num- 


ber of terms in the expansion Rapid convergence in the first 
few terms is demonstrated in this graph 

The first eight weighted eigenvectors are plotted in Fig 13 
Note that the magnitude of the first eigenvector has the gen- 
eral shape of a typical spectral response of soil, since the pre- 
dominant response in two of the three classes is soil, this first 
eigenfunction may be thought of as the average response over 
the ensemble The second eigenvector tends to divide up the 
spectrum into three to four relatively broad regions As the 
number of terms in ihe expansion is increased, the terms that 
are added require higher spectral resolution to reduce mean- 
square error, indicating that spectral fine structure may be in- 
creasingly important. Wc will comment further on tlus later. 

In Fig 14 the estimated probability of correct classification 
is graphed as a function of the expected mean-square error It 
is seen that at least the first eight basis functions contribute 
significantly to the classification accuracy The numbers in 
parentheses on Fig 1 3 indicate the order in which the features 
were chosen by a feature selection processor 2 The relatively 
broad bands of & 2 W arc indicated as of greatest importance 
to class discrimination but aoparently rather quickly there- 
after the finer spectral structure represented by 0 6 (\) be- 
comes important 

To complete the design illustration, the procedure was used 
to determine a set of physically realizable rectangular spectral 
bands Six to ten bands were required based on a study of the 
dimensionality An initial *el of band edges was chosen based 
on an examination of the eigenvectors of the May 8 data set 
From (la) it is observed that regions of the spectrum that are 
significantly different from zero for a particular eigenvector 
may contain important information This, together with the 
feature ordering determined by a standard feature selection 
algorithm, indicates spectral mtcivals important for discrimi- 
nation Zero crossings of the eigenvectors may be possible in- 
dicates of band edges 

Based upon such a study of the optima* * unctions a set of 
rectangular spectral bands was chosen. The final set of band 
edges for the example design is presented in Table II In 
Fig 15, comparison of the two simulated sensors of Table I 
and the optimum set is made on the basis of classification per- 
formance for the May 8 data The first ten optimal basis func- 
tions were used from the optimal design, where 10 features 
provide a small mean-square error yet keep the computational 
difficulties involved with high dimensional systems to a rea- 
sonable level As shown in Fig 15, the four-band sensor 1 
compares rather poorly with the best four 3 optimal basis func- 
tions, showing sensor 1 to be far from optimal for this set of 
classes Sensor 2 is an improvement over sensor 1 , but still 
somewhaJ lower in performance than desirable The eight- 
band example design, however, demonstrates very good per- 
formance relative to the performance of the ten feature 
optimal set. Also, the expected mean-square error is signifi- 
cantly reduced over that of sensors 1 and 2 It should be 


‘The remits for the other five data sets, which were from different 
times of the year at the North Dakota site and a Kansas site, are con- 
tained in ( I6j 


a Thc feature selection scheme is based upon maximizing the average 
imerc lass- pair divergence The details are in [16] 

3 Thc term “be**" refers here to those chosen by a feature selection 
algorithm ss best for discriminating between the design set classes 
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Fig 14 Estimate of probability of correct classification versus ex- 
pected mean-square error Williams County, May 8, 1977, using weight 
function number 2 


Fig 15 Comparison of sensor performance WiUiams County, May 8, 
1977 


Fig 16 Comparison of sensor performance Average of six cases 


TABLE I! 

Spectral Band Locations for the Example Design 


43y« - J**n 
56*t» - 66*xa 

68*oi - 70 ia 
72u» - 90 jn 

92 *» - 1 00** 
1 02** - 1 30m 
1 33*o - 1 74*o 
1 93*o - 2 40m 


pointed out that these comparisons of classification per- 
formance are based on a particular set of information classes 
and the results may be different if different classes were 
chosen The reduced representation error may imply in- 
creased robustness in performance for differing information 
classes 

The performance was evaluated for the six data sets and the 
average performance for each of the sensors is shown in Fig 
16 Again the experimental sensor is quite dose in perfor- 
mance to the optimal, a difference of about one percent. 
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fig 17 Correlation coefficients versu' wavelength for Williams County, May 8, 1977 


Although the primary purpose of this work was to develop a 
design procedure for sensors, important contributions to the 
understanding of the scene can be gained Properties such as 
signal dimensionality for a given stratum, maximum possible 
cla*s separability, the spectral resolution required, and the 
accuracy of spectral representation required to obtain a given 
level of performance can be studied 
The dimensionality of the ensemble can be defined as the 
number of terms in the expansion necessary to represent the 
original waveform to the desired accurac) For the in forma- 
lion classes used on the six data sets the dimensionality was 
between six and eight Some of the strata required fewer 
terms in the expansion to obtain a good representation and 
good performance while others required more terms 
A review of the eigenvectors of Fig 13 shows an unexpected 
amount fine structure, particularly in the region from 0 66 
to I 26 jmi as previously noted Up to this time it had often 
been observed that spectral bands very near to one another 
tend to be highly correlated and thus redundant To investi- 
gate this structure a bit further the correlation coefficient be- 
tween each band of width 0 02 jim and its nearest neighbor 
was calculated The results are shown in Fig 17 Based upon 
previously known results one might expect the graph of the 
figure to deviate little from unity The fact that it does de- 
viate significantly from unity changing significantly in very 
narrow bands may be further evidence that there is useful in- 
formation in the spectral fine structure 


VI Conclusion 

The design of the spectral bands for multispectral instru- 
ments has been an important matter for a number of years In 
the past such design has of necessity been carried out as a 
spectral band selection problem using in a subject’ >e manner 
the direct application of experience and judgment Nonover- 
lapping bands were selected one-by-one based upon the cur- 
rent understanding of reflectance phenomena Each band was 
generally selected on its own individual meters because evalu- 
ating sets of bands is difficult to do without computational 
aids A typical example of this process was the design of the 
Thematic Mapper bands [7] Although this procedure has 
served adequately to thr time, as the level of sophistication of 
applications and, therefore of data needs continues to grow, it 
seems clear that more quantitative and objective design tools 
are needed It was this need to which the work reported 
herein was directed 

As has been seen the design technique which resulted 
utilizes an empirical stochastic scene model as the best cur- 
rently feasible means for defining quantitatively the scene and 
its information and noise content Since not all sensor system 
limitations can be analytically specified, the technique utilizes 
an optimal design calculation but it still requires engineering 
judgment as the best means for resolving the tradeoff of more 
tangible factors with those winch are less tangible It is by no 
means intended as an “automatic” design procedure 
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The optimal design calculation was established utilizing an 
overall performance measure, probability of correct classifica- 
tion, and an intermediate one, spectral representation error 
averaged over the ensemble. One can argue that it is overall 
system performance which is, after all, the most important. 
However, a specific performance measure calculation can only 
be earned out with regard to a specific set of classes Different 
applications frequently require the same data set to be classi- 
fied into different class sets; thus ore desires robustness with 
regard to various class sets as well m a feature set. The hy- 
pothesis used here is that a concern for representation ac- 
curacy will tend to insure thi^ robustness, thus the bilateral 
approach to performance index 
The demonstration presented here of the design technique 
appea.s to warrant the conclusion that the calculation pro- 
cedure provides a substantial aid to the overall design process 
With relatively limited effort a feature set with good overall 
performance was obtained On the other hand, the band set 
given in Table II should by no means be regarded as worthy of 
consideration for implementation as is The ensemble defini- 
tion and sampling were not an important point in this work 
and would require much more detailed and careful con- 
sideration than they received here The same is true of the 
band set and its comparison to alternative choices 
And finally, we again draw attention to the use of the tools 
of this design procedure for carrying out more basic research 
into understanding the scene itself and its spectral cnaracteris- 
tics By being able to determine and quantitatively assess the 
information-bearing attributes of scene spectral characteristics, 
the ability to study potentially important characteristics of 
the scene is greatly facilitated 
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